MSDS 452

Assignment 4

Twitter Networks

The Graph of Co-Occurrent Hashtags (in Tweets) on a certain Date

The Graph of Mentions among Tweeple on a certain Date


Mentions among people resulted in too many edges - I let the next cell for 12 hours, and it only got through the first 3 then crashed

this is because this dataset is a much smaller "community" than the example dataset. The provided COVID set has a max of ~4000 unique tweeple mentioned; this companies set only has a max of 100 - which means it is more interconnected.

I'm going to reduce to a sample to make it more manageable

The graph of mentions among tweeple on 2019-12-16 has 172 nodes and 61371 edges
The graph of mentions among tweeple on 2019-12-17 has 242 nodes and 110449 edges
The graph of mentions among tweeple on 2019-12-18 has 284 nodes and 202622 edges
The graph of mentions among tweeple on 2019-12-19 has 295 nodes and 226202 edges
The graph of mentions among tweeple on 2019-12-20 has 316 nodes and 298395 edges
The graph of mentions among tweeple on 2019-12-21 has 323 nodes and 310966 edges
The graph of mentions among tweeple on 2019-12-22 has 327 nodes and 322064 edges
The graph of mentions among tweeple on 2019-12-23 has 341 nodes and 434267 edges
The graph of mentions among tweeple on 2019-12-24 has 349 nodes and 475948 edges
The graph of mentions among tweeple on 2019-12-25 has 351 nodes and 484423 edges
The graph of mentions among tweeple on 2019-12-26 has 357 nodes and 551368 edges
The graph of mentions among tweeple on 2019-12-27 has 362 nodes and 617550 edges
The graph of mentions among tweeple on 2019-12-28 has 363 nodes and 626133 edges
The graph of mentions among tweeple on 2019-12-29 has 363 nodes and 642636 edges
The graph of mentions among tweeple on 2019-12-30 has 366 nodes and 717960 edges
The graph of mentions among tweeple on 2019-12-31 has 367 nodes and 774880 edges

The Graph of Co-Occurrent Named Entities (in Tweets) on a certain Date